Asynchronous progressive irregular prefix operation in HPF2
نویسندگان
چکیده
In this paper, we study one kind of irregular computation on distributed arrays, the irregular prefix operation, that is currently not well taken into account by the standard data-parallel language HPF2. We show a parallel implementation that efficiently takes advantage of the independent computations arising in this irregular operation. Our approach is based on the use of a directive which characterizes an irregular prefix operation and on inspector/executor support, implemented in the CoLuMBO library, which optimizes the execution by using an asynchronous communication scheme and then communication/computation overlap. We validate our contribution with results achieved on IBM SP2 for basic experiments and for a sparse Cholesky factorization algorithm applied to real size problems.
منابع مشابه
Contribution to Better Handling of Irregular Problems in HPF2
In this paper, we present our contribution for handling irregular applications with HPF2 and some experimental results. We propose a programming style of irregular applications close to the regular case, so that both compile-time and run-time techniques can be more easily performed. We use the well-known tree data structure to represent irregular data structures with hierarchical access, such a...
متن کاملContention-Free Communication Scheduling for Irregular Data Redistribution in Parallelizing Compilers
The data redistribution problems on multi-computers had been extensively studied. Irregular data redistribution has been paid attention recently since it can distribute different size of data segment of each processor to processors according to their own computation capability. High Performance Fortran Version 2 (HPF-2) provides GEN_BLOCK data distribution method for generating irregular data d...
متن کاملMessage Clustering Technique towards Efficient Irregular Data Redistribution in Clusters and Grids
Efficient scheduling algorithms are essential to irregular data redistribution in cluster grid. Cluster grid is an environment with heterogeneous computing nodes and complex network. It is important for schedulers to keep an eye on load balance and low communication cost while distributing different size of data segment on various processors. High Performance Fortran Version 2 (HPF2) provides G...
متن کاملCurrency Reform in Turkey and Lessons from Re-Denomination and Euro Changeover
Re-denomination was not only a zero dropping and new coin minting operation but also a significant milestone in Turkey. Before the currency reform, multiple zeros had posed several difficulties in expressing monetary values, transactions, bookkeeping and statistical records, data processing software, payment systems, price labels etc. So, removing six zeros from national currency became a ...
متن کاملAn Efficient Scheduling Algorithm for Irregular Data Redistribution
Dynamic data redistribution is used to enhance the performance of an algorithm and to achieve data locality in parallel programs on distributed memory multi-computers. The data redistribution problem has been extensively studied. Previous results focus on reducing index computational cost, schedule computational cost, and message packing/unpacking cost. However, irregular data redistribution is...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000